Corpus: hin_wikipedia_2007_10K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 5 34 87 96 98
1000 25 246 703 942 988
10000 151 1835 4979 8212 9522
100000 151 1836 4980 8213 9523
1000000 151 1836 4980 8213 9523


Zipf's diagram for sentence endings


Gnuplot diagram

1339 msec needed at 2017-12-22 17:38